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Text Categorization: Unsupervised document classification using sequentis 
Noam Slonim, Nir Friedman, Naftali Tishby 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 
retrieval 

Full text available:^ pdf(236.71 KB) Additional Information: full citation, abstract, referen 

We present a novel sequential clustering algorithm which is motivated by the In 
contrast to the agglomerative IB algorithm, the new sequential (sIB) approach i: 
maximum of the information with time and space complexity typically linear in t 
the original IB principle. Moreover, the time and space complexity are significan 
unsup ... 
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2 Web clustering and usage mining: Clustering documents in a web directory 
Giordano Adami, Paolo Avesani, Diego Sona 

November 2003 Proceedings of the fifth ACM international workshop on Web inforrr 

Full text available:^ pdf(180.53 KB) Additional Information: full citation, abstract, referenc 

Hierarchical categorization of documents is a task receiving growing interest due 
hierarchies for text documents. The worst problem of hierarchical supervised clc 
labeled examples, whose amount is related to the number of topics in the taxon 
hierarchy with a proper set of labeled examples is a critical issue. In this paper, 
bootstrapping problem, imp ... 

Keywords: TaxSOM, constrained clustering, digital libraries, k-means, knowledge 
process, text categorization, web directories 



3 Knowledge management session 4: indexing: Bootstrapping for hierarchica 
Giordano Adami, Paolo Avesani, Diego Sona 

November 2003 Proceedings of the twelfth international conference on Information 

Full text available:^ pdf(180.73 KB) Additional Information: full citation, abstract, referen 

Managing the hierarchical organization of data is starting to play a key role in th 
due to the great amount of human resources needed to create and maintain the 
Machine learning community has in part addressed this problem by developing \ 
maintainers to categorize new resources within given hierarchies. Although such 
relational knowledge, they ... 

Keywords: TaxSOM, constrained clustering, k-means, taxonomy bootstrapping p 



4 Special issue on special feature: Distributional word clusters vs. words for t 
Ron Bekkerman, Ran El-Yaniv, Naftali Tishby, Yoad Winter 
March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available: ^ pdf(1 76.53 KB) Additional Information: full citation, ? 

We study an approach to text categorization that combines distributional cluster 
Machine (SVM) classifier. This word-cluster representation is computed using th< 
Bottleneck method, which generates a compact and efficient representation of d 
classification power of the SVM, this method yields high performance in text cab 
SVM with word-cluster representation ... 
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5 Knowledge Management: Organizing What You Know: FOCI: flexible orgar 
Hwee-Leng Ong, Ah-Hwee Tan, Jamie Ng, Hong Pan, Qiu-Xiang Li 
October 2001 proceedings of the tenth international conference on Information anc 

Full text available:^ pdf(1 .27 MB) Additional Information: full citation, abstract, ref< 

This paper describes how an integrated web-based application, code-named FO( 
Intelligence), can help the knowledge worker in the gathering, organizing, track 
intelligence or knowledge bases on the web. It shows how text mining technique 
clustering, trend analysis and visualization techniques can be used synergisticall 
information gathered from the web ... 

Keywords: clustering, competitive intelligence, personalization, text mining, trer 



6 Identifying and testing of signatures for non-volatile biomolecules using tan 
Ray R. Hashemi, Theresa M. Schafer, William G. Hinson, Jackson 0. Lay 
February 1996 Proceedings of the 1996 ACM symposium on Applied Computing 
Full text available:^ pdf(626.77 KB) Additional Information: full citation, references, index ter 



Keywords: Kohonen self organizing map, classification, pattern generation, tand 



7 Intrusion and privacy: Exploiting unlabeled data in ensemble methods 
Kristin P. Bennett, Ayhan Demiriz, Richard Maclin 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on Knov 

Full text available: ^ pdf(71 9.46 KB) Additional Information: full citation, abstract, referen 

An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that co 
both labeled and unlabeled data. ASSEMBLE alternates between assigning "pseu 
the existing ensemble and constructing the next base classifier using both the la 
Mathematically, this intuitive algorithm corresponds to maximizing the classifica 
measured on both the labeled and unlabel ... 

Keywords: boosting, classification, ensemble learning, semi-supervised learning 
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8 Special issue on special feature: An introduction to variable and feature sel 
Isabelle Guyon, Andre Elisseeff 

March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available: ^ pdf(862.82 KB) Additional Information: full citatioi 

Variable and feature selection have become the focus of much research in areas 
tens or hundreds of thousands of variables are available. These areas include te 
expression array analysis, and combinatorial chemistry. The objective of variable 
prediction performance of the predictors, providing faster and more cost-effecti\ 
understanding of the ... 

9 Data clustering: a review 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), Volume 31 Issue 3 

Full text available:^ pdf(636.24 KB) Additional Information: full citation, abstract, references, citir 

Clustering is the unsupervised classification of patterns (observations, data item 
(clusters). The clustering problem has been addressed in many contexts and by 
reflects its broad appeal and usefulness as one of the steps in exploratory data < 
problem combinatorially, and differences in assumptions and contexts in differer 
useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, in> 
unsupervised learning 



10 The exploration of legal text corpora with hierarchical neural networks: a gi 
Dieter Merkl, Erich Schweighofer 

June 1997 Proceedings of the sixth international conference on Artificial intelligei 

Full text available: ^ pdf(1 .12 MB) Additional Information: full citation, references, citings, 



11 Data mining solves tough semiconductor manufacturing problems 
Mike Gardner, Jack Bieker 

August 2000 Proceedings of the sixth ACM SIGKDD international conference on Knov 

Full text available:^ pdf(2.27 MB) Additional Information: full citation, references, index 



Keywords: data mining, machine learning, manufacturing optimization, neural n 
induction, self organizing maps, semiconductor yield enhancement 
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12 Exploration of text collections with hierarchical feature maps 
Dieter Merkl 

July 1997 ACM SIGIR Forum , Proceedings of the 20th annual international ACM SIC 
development in information retrieval, Volume 31 Issue SI 

Full text available: m pdf(1 .65 MB) Additional Information: full citation, references, citing: 



13 Image annotation and video summarization: Temporal event clustering for < 
Matthew Cooper, Jonathan Foote, Andreas Girgensohn, Lynn Wilcox 
November 2003 Proceedings of the eleventh ACM international conference on K 
Full text available:^ pdf(200.13 KB) Additional Information: full citation, abstract, refereno 

We present similarity-based methods to cluster digital photos by time and imag< 
unsupervised, and makes minimal assumptions regarding the structure or statis 
results for the algorithm based solely on temporal similarity, and jointly on temp 
also describe a supervised algorithm based on learning vector quantization. Fina 
the proposed algorithms and ... 

Keywords: digital photo organization, temporal media indexing and segmentatio 



14 Performance comparison of neural networks and pattern recognition techni 
transducers 

M. S. Obaidat, D. S. Abu-Saymeh 

March 1992 Proceedings of the 1992 ACM/SIGAPP symposium on Applied computing 

Full text available:^ pdf(1 .03 MB) Additional Information: full citation, references, in 



15 Supervised adaptive resonance networks 
R. S. Baxter 

May 1991 Proceedings of the conference on Analysis of neural network applications 
Full text available:^) pdf(1 .44 MB) Additional Information: full citation, references, index terms 
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16 Integrating automatic genre analysis into digital libraries 
Andreas Rauber, Alexander Muller-Kogler 

January 2001 Proceedings of the first ACM/IEEE-CS joint conference on Digital 
Full text available:^ pdf(672.98 KB) Additional Information: full citation, abstract, referenc 

With the number and types of documents in digital library systems incr easing, t 
presenting the content have to be found. While many approaches focus on topic 
hardly any system incorporates automatic structural analysis and representatior 
forms one of the most distinguishing features in conventional libraries and in inf 
present an approach to au ... 

Keywords: SOMLib, document clustering, genre analysis, metaphor graphics, se 



17 Book reviews 
Karen T. Sutherland 

December 2000 intelligence, Volume 11 Issue 4 

Full text available:^ pdf(357.20 KB) ||) html(39.65 KB) Additional Information: full citation, references, index te 



18 Experiences with criticality predictions in software development 
Christof Ebert 

November 1997 ACM SIGSOFT Software Engineering Notes , Proceedings of the 6th I 
5th ACM SIGSOFT international symposium on Foundations of softw; 

Full text available:^ pdf(1 .36 MB) Additional Information: full citation, references, citing: 



Keywords: classification, criticality prediction, data analysis, eomplexity, quality 



19 Position papers: Artificial neural networks: a science in trouble 

Asim Roy 

January 2000 ACM SIGKDD Explorations Newsletter, Volume 1 Issue 2 

Full text available:^ pdf(646.93 KB) Additional Information: full citation, abstract, 

This article points out some very serious misconceptions about the brain in conr 
Some of the connectionist ideas have been shown to have logical flaws, while ot 
commonly observed human learning processes and behavior. For example, the < 
provision for learning from stored information, something that humans do all th< 
is definitely a need for some ne ... 

Keywords: artificial neural networks, automated learning, brain-like learning, co 
systems 
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20 Summarization: The use of unlabeled data to improve supervised learning 1 

Massih-Reza Amini, Patrick Gallinari 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 



With the huge amount of information available electronically, there is an increas 
summarization systems. The use of machine learning techniques for this task all 
needs and to the corpus characteristics. These desirable properties have motiva 
field over the last few years. Most approaches attempt to generate summaries t 
adopt the supervised learning paradigm ... 

Keywords: machine learning, semi-supervised learning, text summarization, tex 
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21 F3MCNN: a fuzzy minimum mean maximum clustering neural network 
Liangtsan G. Wu, Huizhu Lu 

March 2000 Proceedings of the 2000 ACM symposium on Applied computing 

Full text available:® pdf(399.52 KB) Additional Information: full citation, references, index terms 



Keywords: ART, F3MCNN, clustering, fuzzy logic, neural network 



22 Think globally, fit locally: unsupervised learning of low dimensional manifok 
Lawrence K. Saul, Sam T. Roweis 

September 2003 The Journal of Machine Learning Research, Volume 4 

Full text available:^) pdf(2.91 MB) Additional Information: full citation, £ 

The problem of dimensionality reduction arises in many fields of information pro 
compression, scientific visualization, pattern recognition, and neural computatio 
embedding (LLE), an unsupervised learning algorithm that computes low dimen; 
embeddings of high dimensional data. The data, assumed to be sampled from a 
single global coordinate system of lowe ... 
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23 Fuzzy neural network models for clustering 
A. D. Kulkarni, V. K. Muniganti 

February 1996 Proceedings of the 1996 ACM symposium on Applied Computing 
Full text available:^ pdf(850.43 KB) Additional Information: full citation, references, index terms 



24 Identifying prospective customers 

Paul B. Chou, Edna Grossman, Dimitrios Gunopulos, Pasumarti Kamesam 

August 2000 Proceedings of the sixth ACM SIGKDD international conference on Knov 

Full text available:^ pdf(170.89 KB) Additional Information: full citation, references, ind 



Keywords: customer prospecting 



25 Information filtering: the computation of similarities in large corpora of legal 
Erich Schweighofer, Werner Winiwarter, Dieter Merkl 

May 1995 Proceedings of the fifth international conference on Artificial intelligence c 

Full text available:^ pdf(748.19 KB) Additional Information: full citation, references, citings, index terms 



26 Cluster ensembles — a knowledge reuse framework for combining multiple 
Alexander Strehl, Joydeep Ghosh 

March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available:^ pdf(842.50 KB) Additional Information: full citation, abstract, referem 

This paper introduces the problem of combining multiple partitionings of a set ol 
clustering without accessing the features or algorithms that determined these p< 
application scenarios for the resultant 'knowledge reuse' framework that we call 
problem is then formalized as a combinatorial optimization problem in terms of : 
a direct ... 

Keywords: cluster analysis, clustering, consensus functions, ensemble, knowled< 
information, partitioning, unsupervised learning 
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27 Variational learning of clusters of undercomplete nonsymmetric independei 
Kwokleung Chan, Te-Won Lee, Terrence J. Sejnowski 

March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available: 1 ® pdf(345.39 KB) Additional Information: full citation, abstract, references, 

We apply a variational method to automatically determine the number of mixtur 
high-dimensional datasets, in which the sources may be nonsymmetrically distri 
where each cluster is described as a linear mixture of independent factors. The ' 
accurate density model for the observed data without overfitting problems. This 
be identified for each cluster. Th ... 

Keywords: Bayesian learning, ICA, density estimations, mixture models 

28 Fast supervised dimensionality reduction algorithm with applications to doc 
George Karypis, Eui-Hong (Sam) Han 

November 2000 Proceedings of the ninth international conference on Information ar 
Full text available:® pdf(270.71 KB) Additional Information: full citation, references, in« 



29 Task clustering and gating for bayesian multitask learning 
Bart Bakker, Tom Heskes 

September 2003 The Journal of Machine Learning Research, Volume 4 

Full text available: "@ pdf(229.33 KB) Additional Information: full citation, 

Modeling a collection of similar regression or classification tasks can be improve! 
other'. In machine learning, this subject is approached through 'multitask learnii 
multiple outputs of the same network. In multilevel analysis this is generally im| 
linear model where a distinction is made between 'fixed effects', which are the s 
which may vary bet ... 

30 Scalable feature selection, classification and signature generation for orgar 
hierarchical topic taxonomies 

Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan 

August 1998 The VLDB Journal — The International Journal on Very Large D 

Full text available:^ pdf(281 .37 KB) Additional Information: full citation, abstract, citings 

We explore how to organize large text databases hierarchically by topic to aid b< 
Many corpora, such as internet directories, digital libraries, and patent database 
hierarchies, also called taxonomies. Similar to indices for relational data, taxono 
efficient. However, the exponential growth in the volume of on-line textual infor 
maintain such taxono ... 
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31 Industry track papers: Mining heterogeneous gene expression data with tirr 
Yulan Liang, Arpad Kelemen 

July 2002 proceedings of the eighth ACM SIGKDD international conference on Knov 
Full text available: pdf(1 .48 MB) Additional Information: full citation, abstract, reference 

Heterogeneous types of gene expressions may provide a better insight into the 
environment, disease development and drug effect at the molecular level. In thi 
purposes a Time Lagged Recurrent Neural Network with trajectory learning is pr 
gene functional patterns from the heterogeneous nonlinear time series microarr 
identify gene fun ... 

Keywords: backpropagation through time, gene expression, heterogeneous, tim 
learning 



32 A parallel topological feature map in APL 

J. Frey, D. Scheppelmann, G.-P. Glombitza, H. Meinzer 

September 1993 ACM SIGAPL APL Quote Quad , Proceedings of the international cor 

Full text available: "HI pdf(467.63 KB) Additional Information: full citation, abstract, referen 

One can distinguish two different approaches of neural networks the supervised 
unsupervised neural networks. The first type of neural nets is supplied with an u 
learning procedure, the neural net adjusts weighting factors of the links betweei 
vectors map to the ideal output. Those nets are used for example in robotics, wl 
the position the robot ... 

33 Spatiotemporal Databases: Learning geoscience categories in Situ: implies 
representation 

Boyan Brodaric, Mark Gahegan 

November 2001 Proceedings of the ninth ACM international symposium on Advances 
Full text available:^ pdf(2.24 MB) Additional Information: full citation, abstract, reference 

This paper explores the development of categories shared in the field logging of 
Visualization, neural networks and spatial statistical tools are employed to gain i 
attributes observed, and into the categories developed. Background material an 
findings in the light of research into category development, and specifically how 
modified as part of the (geo)s ... 

Keywords: category development, classification, geological fieldwork, informatio 
learning 
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34 Document clustering using word clusters via the information bottleneck mel 

Noam Slonim, Naftali Tishby 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on R< 
retrieval 

Full text available:^ pdf(854.89 KB) Additional Information: full citation, abstract, references, 

We present a novel implementation of the recently introduced in 
unsupervised document clustering. Given a joint empirical distril 
p(x, y), we first cluster the words, Y, so that the obtained word 
preserve the information on the documents. The resulting joint < 
contains most of the original information about the documents, 

35 Survey articles: Data mining for hypertext: a tutorial survey 
Soumen Chakrabarti 

January 2000 ACM SIGKDD Explorations Newsletter, Volume 1 Issue 2 

Full text available:^ pdf(1 .19 MB) Additional Information: full citation, abstract, refere 

With over 800 million pages covering most areas of human endeavor, the World 
mining research to make a difference to the effectiveness of information search, 
through two dominant interfaces: clicking on hyperlinks and searching via keywi 
and unsatisfactory. Better support is needed for expressing one's information n* 
more structured ways than av ... 

36 Posters: Image classification using hybrid neural networks 
Chih-Fong Tsai, Ken McGarry, John Tait 

July 2003 Proceedings of the 26th annual international ACM SIGIR conference on Rt 
retrieval 

Full text available:^ pdf(1 99.31 KB) Additional Information: full citation, abstract, referen 

Use of semantic content is one of the major issues which needs to be addressed 
effectiveness. We present a new approach to classify images based on the comt 
and hybrid neural networks. Multiple keywords are assigned to an image to repr 
content. Images are divided into a number of regions and colour and texture fee 
self-organising map (SOM) clusters ... 

Keywords: content-based image retrieval, image indexing/classification, neural i 
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37 Clustering algorithms: Alternatives to the k-means algorithm that find better 
Greg Hamerly, Charles Elkan 

November 2002 Proceedings of the eleventh international conference on Informatio 
Full text available: @ pdf(1 .32 MB) Additional Information: full citation, abstract, reference: 

We investigate here the behavior of the standard k-means clustering algorithm ; 
k-harmonic means algorithm due to Zhang and colleagues, fuzzy k-means, Gau; 
new variants of k-harmonic means. Our aim is to find which aspects of these ale 
clusterings, as opposed to converging to a low-quality local optimum. We descri 
that introduces separate cluster membe ... 

Keywords: clustering quality, k-harmonic means, k-means, unsupervised classif 



38 Machine learning in automated text categorization 
Fabrizio Sebastiani 

March 2002 ACM Computing Surveys (CSUR), Volume 34 Issue 1 

Full text available:^ pdf(524.41 KB) Additional Information: full citation, abstract, references, 

The automated categorization (or classification) of texts into predefined categor 
the last 10 years, due to the increased availability of documents in digital form < 
the research community the dominant approach to this problem is based on ma< 
inductive process automatically builds a classifier by learning, from a set of prec 
the categories. ... 

Keywords: Machine learning, text categorization, text classification 



39 High performance data mining (tutorial PM-3) 
Vipin Kumar, Mohammed Zaki 

August 2000 Tutorial notes of the sixth ACM SIGKDD international conference on Kn 

Full text available:^ pdf(8.06 MB) Additional Information: full citation, references, inde 
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40 A.I. and computational logic: An effective document clustering method usin 

Han-joon Kim, Sang-goo Lee 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing 

Full text available:^ pdf(478.99 KB) Additional Information: full citation, abstract, referen 

Document clustering is inherently an unsupervised learning process that organi2 
groups without depending on pre-specified knowledge. However, real-world app 
hierarchy for a large document collection, need to perform clustering under vari> 
presents a new type of supervised clustering to organize information in a way tr 
As a means by which externa ... 

Keywords: document clustering, hierarchical clustering, information organizatior 
knowledge 
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41 Special issue on special feature: Sufficient dimensionality reduction 
Amir Globerson, Naftali Tishby 

The Journal of Machine Learning Research, Volume 3 



March 2003 



Full text available: 1§ pdf(266.18 KB) 



Additional Information: full citatioi 



Dimensionality reduction of empirical co-occurrence data is a fundamental probl 
well studied problem in statistics known as the analysis of cross-classified data, 
to represent the data in low dimension with minimal loss of (mutual) informatior 
paper we introduce an information theoretic nonlinear method for finding such a 
contrast wi ... 
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42 Automated revsion of GIS databases 
Volker Walter, Dieter Fritsch 

November 2000 Proceedings of the eighth ACM international symposium on Advance 

Full text available:^) pdf(703.89 KB) Additional Information: full citation, abstract, 

Digital spatial data are underlying strong temporal changes. The 
these changes is to check the data manually for their correctnes 
up-to-date orthophotos. The update cycles of large data sets an 
present shorter update cycles are unrealizable for two reasons. " 
is very cost- and time-consuming and aerial photographs for Ian 
available in the needed ... 

Keywords: ATKIS, GIS, classification, matching, remote sensing 



43 Self organizing neural networks with a split/merge algorithm 
A. D. Kulkarni, G. M. Whitson 

February 1990 Proceedings of the 1990 ACM SIGSMALL/PC symposium on Sma 

Full text available:^! pdf(525.1 3 KB) Additional Information: full citation, abstract, referee 

In this paper we present a new learning algorithm for Artificial Neural Networks 
ANN model with the new algorithm has been developed and tested on a PC. The 
input patterns, and identifies the number of categories present in the input sam 
competitive learning algorithm. Unlike the competitive learning algorithm, in thi: 
long term weights ( ... 

^Special issue on ICML: Coupled clustering: a method for detecting structun 
Zvika Marx, Ido Dagan, Joachim M. Buhmann, Eli Shamir 
March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available:^ pdf(967.1 5 KB) Additional Information: full citatior 

This paper proposes a new paradigm and a computational framework for reveali 
sub-structures of distinct composite systems that are initially represented by un 
introduce and investigate a variant of traditional data clustering, termed couplec 
of corresponding subsets of two such representative sets. We apply our method 
achievement ... 
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45 An empirical comparison of supervised machine learning techniques in bioi 
Aik Choon Tan, David Gilbert 

January 2003 Proceedings of the First Asia-Pacific bioinformatics conference on Bio 

Full text available:^) pdf(234.35 KB) Additional Information: full citation, abstract, referen 

Research in bioinformatics is driven by the experimental data. Current biological 
amounts of experimental data. Machine learning has been widely applied to bioii 
in this research area. At present, with various learning algorithms available in tr 
difficulties in choosing the best method that can apply to their data. We perform 
learning systems and 9 dif ... 

Keywords: bioinformatics, ensemble methods, performance evaluation, supervis 



46 Regularized principal manifolds 

Alexander J. Smola, Sebastian Mika, Bernhard Scholkopf, Robert C. Williamson 
September 2001 The Journal of Machine Learning Research, Volume 1 

Full text available:^ pdf(770.51 KB) Additional Information: full citatior 

Many settings of unsupervised learning can be viewed as quantization problems 
quantization error subject to some restrictions. This allows the use of tools such 
(supervised) risk minimization for unsupervised learning. This setting turns out I 
the generative topographic map, and robust coding. We explore this connection i 
for finding principal man ... 

47 Bioinformatics: Gene functional classification by semi-supervised learning f 
Tao Li, Shenghuo Zhu, Qi Li, Mitsunori Ogihara 

March 2003 Proceedings of the 2003 ACM symposium on Applied computing 

Full text available:^ pdf(601 .62 KB) Additional Information: full citation, abstract, r 

Gene function discovery is an important and interesting problem in computation 
paper, we investigate the use of a semi-supervised learning algorithm for inferri 
heterogeneous data set consisting of DNA microarray expression measurements 
whole-genome sequence compassions. The semisupervised learning approach a 
between individual models built from each sepa ... 

Keywords: Support Vector Machine(SVM), gene functional classification, heteroc 
semi-supervised 
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48 Automatic text representation, classification and labeling in European law 
Erich Schweighofer, Andreas Rauber, Michael Dittenbach 

May 2001 Proceedings of the 8th international conference on Artificial intelligenc 

Full text available:^ pdf(265.20 KB) Additional Information: full citation, abstract, referee 

The huge text archives and retrieval systems of legal informatioi 
representation in the well-known subject-oriented structure of l€ 
Content-based classification and text analysis remains a high pri 
KONTERM, SOM and LabelSOM projects, learning techniques of i 
achieve similar high compression rates of classification and anal' 
The produced maps of legal text co ... 

49 Feature selection in unsupervised learning via evolutionary search 
YeongSeog Kim, W. Nick Street, Filippo Menczer 

August 2000 Proceedings of the sixth ACM SIGKDD international conference on Knov 

Full text available:^ pdf(2 13. 12 KB) Additional Information: full citation, references, ind 



Keywords: clustering, evolutionary search, feature selection 



50 A QoS-Provisioning neural fuzzy connection admission controller for multirr 
Ray-Guang Cheng, Chung-Ju Chang, Li-Fong Lin 

February 1999 IEEE/ACM Transactions on Networking (TON), Volume 7 Issue : 

Full text available:^) pdf(342.90 KB) Additional Information: full citation, references, citings, in< 



51 Special issue on kernel methods: Support vector clustering 
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